ANDROGEN RECEPTOR PROTEINS, RECOMBINANT DNA MOLECULES COOING 
FOR SUCH, ANU USE OF SUCH COMPOSITIONS 

This invention was made in the course of research supported in 
part by grants from the National Institutes of Health (NIH HL> 16910, 
HD 04466, and HD 18968). 

TECHNICAL FIELD OF THE INVENTION 

This invention relates to recombinant UNA molecules and their 
expression products. More specifically this invention relates to 
recombinant DNA molecules coding for androgen receptor protein, 
androgen receptor protein, and use of the UNA molecules and protein 
in investigatory, diagnostic and therapeutic applications. 

B AC K GROUND OF THE INVENTION 

The naturaVlly occurring androgenic hormones, testosterone and 
its 5 -reduced ntetabolite, dihydrotestosterone, are synthesized by 
the Leydig cells\of the testes and circulate throughout the body 
where they diffusa into cells and bind to the androgen receptor 
protein ("AR"). Androgens, acting through their receptor, stimulate 
development of the roale genitalia and accessory sex glands in the 
fetus, virilization and growth in the pubertal male, and maintenance 
of male virility and Reproductive function in the adult. The 
androgen receptor, together with other steroid hormone receptors 
constitute a family of \rans-acting transcriptional regulatory 
proteins that control gene transcription through interactions with 
specific gene sequences. \ 

When prostate cancer is found to be confined to the prostate 
gland, the treatment of choice is surgical removal. However, bO to 
80% of prostate cancer patients already have metastases at the time 
of diagnosis. Most of their tumors (70 to 80%) respond to the 
removal of androgen by castration or by suppression of luteinizing 
hormone secretion by the pituitary gland using a gonadotropin 


leasing hormone analogue alone or in combination with an 

n " " The degree and duration of response to this 

anti-andro 9 en. The degree ^ ^ < 3 

treatment is Mgr.!, variable (10% l.» ss 
years, and 10% live > 0 years i ' Qf 

without androgen stimulation but It mately J 

when they might be more likely to respond 

Studies on androgen receptors in prostate cancer have gg 
that a positive correlation may exist between the presence o, 
that a posit, dependence on 

=r.,=: *,;=:•»: r... ; ;r ,; :r -ir:- 

estrogen re P methodological problems ,n the 

estrogen withdrawal). . ted the routine use of 

measurement of androgen rece tor have P « 

•V ~ re " Pt0r 'T.. - ""rr. u" I'.". in human 
r 3 Hl-1abeled androgen. These assays nave 

urther proble. is that biopsy samples are quite sm.lL *n 9 it 
1 to obtain sufficient tissue for L 3 H>androgen bindmg 
I Moreover, prostate cancer is heterogenous with respe 
cell types. Thus within a single biopsy sample there is lUel to 

Hi.tribution of cells containing androgen receptors, 
be an uneven distribution ot ten 

Development of the male phenotype and maturation ot male 
repr od::t:ve P function are dependent on the interaction of androgenic 
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hormones with the androgen receptor protein and the subsequent 
u 7t on of the receptor as a trans-acting '™»' " «™ 
• n It has become veil established over the past 

abnormalities ranging from genetic ma es (4» X » 

phenotype to phenotyplcall, normal males w hu e t my ^ 

relation of the structural gene or h g «^ 

terms. Analysis or regulation of androgen 

i Qa rf to a better understanding ot tne reyu.a 

^ "presslon and of the mechanism of androgen action „ 

male se*ua, development als0 a s testicular 

The '^V. r .^by » inability to respond to 

f ero1 nl zat l on ' ' ° receptor, tne protein that 
androgen due to a defect in tne an a ftndr oQen 
mediates the action of androgen within . e -rog ^ ^ 
insensitivity is an inherited X-lmked tra ^ 

r r rd:;;:sr::- ~ e r a : — - 

.bert The result ^^^s 

r/reit:::::: rr,.r — ,y 

uoran^P of the inability to rebpuuu u ^ 

Because of tne pneno type with normal clitoris and 

genitalia remain in the female pnen 

labia A small vagina develops from the urogenital sinus and e 
„ . bllndToach. At puberty feminization with breast deve opment 

testosterone concentrations are equal to or greater 

the normal male. ncon <--jti vitv syndrome include 

incomplete forms of the androgen i nsensiti vity sy 

a spectrum of phenotypes resulting from varying degrees of 
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incomplete androgen responsiveness. At one extreme, individuals 
have mild enlargement of the clitoris and sparse pubic hair. The 
opposite extreme is characterized by more complete masculmization 
with varying degrees of hypospadias deformity but predominantly a 
5 male phenotype. It has been reported that some adult men with 

severe oligospermia or azoospermia who are otherwise normal, have 
defects in the androgen receptor. These may include as many as 10% 

of infertile males. a. 
The genetic defect eliciting this range of abnormalities is 

10 thought to be a single biochemical event at the level ol " the gene 

for the androgen receptor. The androgen receptor is a high affinity 
androgen binding protein that mediates the effects of testosterone 
and dihydrotestosterone by functioning as a trans-acting inducer or 
gene expression. For proper male sexual development to occur there 

15 is a requirement for androgen and its receptor at a critical time 
during embryogenesi s and during puberty. The majority of 
individuals with the androgen insensitivity syndrome have a history 
of affected family members, although about a third are thought to 
represent new mutations of this X-linked disorder. The incidence 

20 ranges from 1 in 20,000 to 60,000 male births. 

In studies of families with clinical evidence of the androgen 
insensitivity syndrome, four main categories were recognized that 
range from the most severe, complete absence of receptor binding 
activity in a genetic male with female phenotype, to qualitatively 

25 normal receptor in affected individuals. Second in severity are 

affected individuals with qualitatively abnormal androgen binding by 
receptor present in normal levels. Examples include the failure of 
sodium molybdate (a reagent often used in studies on steroid 
receptors) to stabilize the receptor of affected individuals when 
30 molybdate is known to stabilize the wild-type receptor. Lability of 
the receptor under conditions that normally cause transformation has 
also been reported. A third group expresses a decreased amount of 
receptor with wild-type in vitro binding characteristics. The final 
grouping contains those androgen insensitivity patients in 
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whom no abnormaUt, 1. receptor 1s detected. I. a recent stud, of 
this form of the syndrome, the androgen receptor was as capab!e of 
b 19 oUgonudeotldes as the w1,d-t,pe receptor, .ndeed with 
tellies avalUble untl, or,, recent,,. It has been , - u 
5 in certain cases to document an androgen receptor defect i» affected 

individuals. . . 

Experimental methods used in assessing receptor detects in the 
past have relied on the ability of receptor to bind androgen with 
high affinity. The limitation of this methodology is that it is not 
10 possible to distinguish between the lack of expression of the 

receptor and loss of androgen binding activity. An example ot how 
inadequate methodology complicates diagnosis is the absenc e > o 
detectable receptor binding activity in patients who are partially 
virilized. It is theoretically possible for a mutation to occur 
15 which allows the receptor with defective androgen binding activity 
to induce gene transcription. Biologically active 
of the glucocorticoid receptor that lack steroid binding activity 
but retain the DNA binding domain have been demonstrated using 
genetically engineered mutants. 
20 Purification of the androgen receptor has been difficult to 

accomplish due to its low concentration and high degree ot 
instability. Reported attempts at purification using -ther 
conventional methods of column chromatography or steroid-atti nity 
chromatography have yielded insufficient amounts of receptor protein 
25 to allow even the preparation of monoclonal antibodies. 

An early report on the partial purification of the androgen 
receptor was disclosed by Mainwaring et al . in "The use of UNA - 
cellulose chromatography and isoelectric focusing for the 
characterization and partial purification of steroid-receptor 
30 complexes," Biochem 0, 134, 113-127 (1973). They used DNA-cellulose 
chromatography and isoe^tric focusing to isolate the receptor from 
rat ventral prostate and determined its physi ochemical properties. 
This group was among the first to attempt the use of steroid 
affinity chromatography in conjunction with conventional 
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chromatography, using the affinity label 
17 B-bromoacetoxytestosterone in receptor purification (See 
Mainwaring et al., "Use of the affinity label 
17B - br omoacetoxytestosterone in the purification of androgen 
receptor protein,," Perspectives in Steroid Receptor 
a980)) Partial purification of androgen receptor has also b en 
I tea fro» otheV tissue sources, such as r. -«,»icM 
(See Foekens et al.. Molecular CelluUr Endocr, 23, 173-186 198 ) 
and Foekens et al., "Purification of the androgen re epto of sheep 
<™1nal vesicles," Biochem Biophys Res Comm. 104, 1279-U86 

^rtiauy purified receptor displayed characteristics 
la oteolyzed receptor, but a purification of 2,000 fold was 
epor eTw th a recovery of 33% (See Foekens et a,., "Purification 

t androgen receptor of sheep seminal vesicles," Biochem iophys 
L CC-. A- "79-1286 (1982,). Later attempts at purification 
continued to combine steroid affinity chromatography with 
conventional techniques, reportedly achieving significant 
purification, but recoveries too low for further analysis (S C a 9 
e t al "Purification and characterization of androgen receptor from 

eer semenal vesicle," Biochemistry 21, 4102-4109 ,1982,, Chan, et 
al "Purification and characterization of the androgen receptor 
from rat ventral prostate," Biochemistry 22, 6170-6175 (1983, and 
Chang et al., "Affinity labeling of the androgen receptor in rat 
prostate cytosol with 

nB-[(bromoacetyl)oxy]-5-alpha-androstan-3-one ,» Bloch-mry 2^ 
2527-2533 (1984)). More recent studies exa m ine the effectless of 
a variety of immobilized androgens for their ability to bind the 
a variety u -Synthesis and evaluation 

androgen receptor (See De Larminat et al., Synthesis 
of immobilized androgens for affinity chromatography in the 
purification of nuclear androgen receptor," The Prostate 5 123-140 
(1984) and Bruchovsky et al , "Chemical demonstration of nuclear 
androgen receptor following affinity chromatography with immobilized 
ligands," The Prostate 10, 207-222 (1987)). Despite these efforts, 
the receptor has not been purified to homogeneity and 
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the quantities of purified androgen receptor obtained have been 
insufficient for the production of antisera. 

C cal assays for the androgen receptor now include several 
me thods. The most common is the binding of tritium-Ubeled h 0 r m0 ne 
and measurement of binding using a charcoal adsorption assay. 

ither a natural androgen, such as dihydrotestosterone or synthase 
androgen, such as mibolerone or methyl trienol one (R1881 . can ^ 
used. An advantage of the latter in human tissue is that it 
significantly metabolized and does not bind to the 
binding protein, sex steroid binding globulin. A limitation ot 
radioisotope labeling of receptor is interference caused by 
endogenous androgen. Although exchange assays for the androgen 
receptor have been described (See Carroll et al. a Steroid i c 
21 353-359 (1984) and Traish et al . . 0 Steroid Bnochem 23, 405-413 
77985)), their effectiveness is limited by the slow kinet.es of 
dissociation of the endogenous receptor-bound androgen. 

Another method used to assess receptor status is 
autoradiography. In this method disclosed in Barrack et a _ 
"Current concepts and approaches to the study of pr ost. t. c nc r 
Progress in Clinical and Biological Research, 239, 155-187 (198 
the radioactively labeled androgen is incubated with slide-mounted 
tissue sections of small tissue biopsy specimens which are then 
frozen, sectioned and fixed. Nuclear localization of radioactivity 
is detected by exposure of tissue sections to x-ray film -This 
technique requires considerable technical expertise, is labor 
intensive, and requires extended periods of exposure time It 
therefore of limited usefulness in general clinical assays. Another 
problem is high levels of background signal, i.e. a high 
noise/signal ratio, making it difficult to distinguish 
receptor-bound nuclear radioactivity from unbound radioactivity 
distributed throughout the cells. 

WO 87/05049 (Shine) discloses a method for the production of 
purified steroid receptor proteins, specifically estrogen receptor 
proteins, through the expression of recombinant DNA encoding for 
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such proteins in eukaryotic host cells. However, the reference does 
not disclose the sequence for androgen receptor protein, nor does it 
disclose a method for obtaining such a sequence. 

5 .SU MMARY OF THE INVENTION 

Tne present in^on provides a DNA sequence character! zed by 

a structural gene coding for a polypeptide having substantially the 
same biological activity as androgen receptor protein. A DNA 
sequence encoding androgen receptor protein or a protein having 

10 substantially the same biological activity as androgen receptor 

activity is also provided. DNA sequences may be obtained from cDNA 
or genomic DNA, or prepared using DNA synthesis techniques. 

The invention further discloses cloning vehicles comprising a 
DNA sequence comprising a structural gene encoding a polypeptide 

15 having substantially the same biological activity as androgen 
receptor protein. Cloning vehicles comprising a DNA sequence 
encoding androgen receptor protein or a protein having substantially 
the same biological activity as androgen receptor protein is also 
provided. The cloning vehicles further comprise a promoter sequence 

20 upstream of and operatively linked to the DNA sequence. In general 
the cloning vehicles will also contain a selectable marker, and, 
depending on the host cell used, may contain such elements as 
regulatory sequences, polyadenyl ati on signals, enhancers and RNA 
splice sites. 

25 The invention further provides cells transfected or transformed 

to produce androgen receptor protein or a protein having 
substantially the same biological activity as androgen receptor 
protein. 

A further* aspect of the invention provides a purified androgen 

\ . . . i — ^ A * C Wai/i nn 
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30 "Wptor protein and purified polypeptides and proteins having 
substantially tVie same biological activity as androgen receptor 
activity, and methods for producing such proteins and polypeptides. 
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BRIEF DESCRIPTION 


Fi gure 


androgen receptor 
family. (A) 'is\ a 


-9- 


OF THE DRAWINGS 


l\shows a comparison of DNA-binding domains of the human 


(hAR) with members of the nuclear receptor 
comparison of oligo A nucleotide sequence with 
sequences^ hAHnd other nuclear receptors: hPR, human 
progesterone rectytor; hMR , human mi neralocorticoid receptor; hGR, 
human glucocorticoid receptor; hER, human estrogen receptor; hT R 
human thyroid horLe receptor; hRAR, human retinoic acid receptor. 
Chromosomal locat \ns are shown in parentheses at the left. 

ty\ between oligo A and hAR is indicated with an 
rc\nt homology with oligo A is in parentheses at 
sequence. (6) shows the the structure of 
ARHfVi human fibroblast clone [1]). Nucleotide 


Nucleotide identi 
asterisk. The pe 
the right of each 
fibroblast clone 


15 endonuclease site 


residues are numbered Yrom the 5'-terminus. Restriction 


Js weA determined by mapping or were deduced from 
DNA sequence. Thl TGA \ranslation termination codon, determined by 
comparison with HPR, hMR\and hGR, follows a long open reading frame 
containing sequences homo\ogous to those of other steroid 
receptors, 


ArroWs indicaA exon boundaries in genomic clone X05AR. 


is the put\tive DNA binding domain. (C) shows a 
no acid sequences of the AR DNA-binding domain with 
nuclear receWor family. AR amino acid sequence 
nucleotide sequence of clone ARHFL1 and is numbered 
beg1nMng"w1th the first conserved cysteine residue ( + ). Amino acid 
numbers in parentheses at the Aft indicate the residue number of 
the first conserved cysteine fro\ the references indicated above. 


The hatched area 
comparison of ami 
sequences of the 
was deduced from 


with hAR is indicated in parentheses on the right. 
Tne7eg1o7of7he DNA-binding doma\n from which the oligo A sequence 
was derived is underlined in hAR. Voding DNA of residues 1 to 31 is 
contained within genomic clone XObARV Abbreviations in addition to 
those described above are cVDR, chickVn vitamin D receptor, and 
VERBA, erb A protiein from avian erythroblastosis virus. 
Abbreviations fori amino acid residues 


Percent homology 


~e: 
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Ala; C,V S; °' 


Lys; L, Leu>, Met; N. Asn; P. Pro; Q. Bin; R 
V, Val; W, Tr\; and Y , Tyr. 


expressed AR cDN) 
expression vector 1 
promoter of the i 
terminator region 


Asp; E, Glu; F. Phe; G, Gly; H, His; I, He; K. 

Ary; S, Ser; T, Thr; 


Figure 2 Illustrates the steroid binding properties of 


(ft) shows the structure of pCMVAR in the 
„CHV containing the hurcan cytomegalovirus(CMV) 
diate early gene, poly(A) addition-transcription 
terminator ..the hu m an growth hormone gene (hGH p.1, A1.SV40 

igin of replication\(SV40 Ori). and a polylinKer "9-h 
insertion of cD«aJ. V plas^id pTEBR contains the -picllin 

/^ m r^^ shows saturation analysis ot 

EW"£ o To^XlZ 1. e-racts of p«,« transaction 
f o X cens. PortioXof cytoso, (0.! .1 . 0.3 mgMl protei 
were incubated overnight aWc with increasing concentrates o, 
^-labeled hormone and analWd by charcoal adsorption. 

\i f„^ m ift* tn 37% of total bound 
Nonspecific binding 1 ncreasefc f rom 18% to 3/% 

radioactivity. (0) shows a scratched plot analysis ot 
r/^dihydrotestosterone bindiA. Error estimation was based on 
linear regression analysis ( r-oW (D) illustrates the 
competition of unlabeled steroid\ for binding of 5 nM 
L 3 H ]dihydrotestosterone in transacted COS M cell extract 
nl beled steroids were added at iV and 100-fold excess of ladled 
hormone. Specific binding was detained as previous y descnbed. 

Figure 3 is i compiled clone map of the human androgen- 
receptor. The map shows the structure of the human androgen 
receptor gene and the relative positions of the nucleic add 

ces contained in the cDNA probes LAJ, m. LCJ and L> . buman 
oblast clone 13. -man epididymis clones and W. h«n 
genomic clones [li [2], [3], [4] and C bL and rat ep^dynns clones 

"igure'A is [photograph showing chromosome localization of the 
gene on Southe \ blots of DNA from human cells containing 
^ multiple X chromolls and mouse or hamster cells with X-autosome 
translocation chromosomes . 


Vh*P finale strand sequence (b08b bases) 
Hgure 5 shows the cornet « 1- ^ ^ sequence . 

.. the human androgen receptdTand tne 

No intron sequence is * K ™<\ . r uence (,260 bases) 

Figu re 6 Shows the 1 \ 8 fluced amfn0 ac id sequence. 
of the rat androgen receptor and\th « ^ 

f , 9 ure 7 Is a photograph of f °. " t „. ftR peDt1de 

NH.-Asp-ms- a 1-W ■ - av1di „. blotln peroxidase 

polymorphisms ll ^^^"^^TsoM^ni-yct analysis in 

syndrome patient^. 

^rTo^^ - - — 

Nucleotide ^ co „sisting of a sugar moiety 

A m onomenc unit of DMA ° r nete roc,clic base. The 

(pentose), a phosphate,, and mtrogen ^ (J , 

base is linked to the sugar moiety via the g y ^ 
car bon of the pentose, and that -bin^o f » » ^ 

„„deoside. The base characterizes t e ^ ^ 

base5 .are adenine ("A"), guanine ( G >. cyt. ft ^ Q 

(..,..). The four KNA bases are A G C and ur _ 
are purines, abbreviated to R, and C, T, and 
abbreviated to Y. 

"Tries of nucleotides connected one to tne otner b, 

pentoses. 
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Codon 

A DNA sequence of three nucleotides (a triplet) which encodes 
through mRNA an amino acid, a translational start signal or a 
translational termination signal. For example, the nucleotide 
triplets TTA, TTG, CTT, CTC, CTA and CTG encode for the amino acid 
leucine ("Leu"), TAG, TAA and TGA are translational stop signals and 
ATG is a translational start signal. 
Reading Frame 

The grouping of codon's during translation of mRNA into amino 
acid sequences.! During translation the proper reading frame must be 
maintained. Fol example, the sequence GCTGGTTGTAAG may be 
translated in thVee reading frames or phases, each of which affords 
a different aminb acid sequence: 

GCT GGT TGT1AAG - Al a-Gly-Cys-Ly s 
G CTG GTT GM AG - Leu-Val-Val 
GC TGG TTG Tf\A A - Trp-Leu- (STOP) 
Polypeptide 

A linear series of amino acids connected one to the other by 
peptide bonds between the a-ami no and carboxy groups of adjacent 

amino acids. 
Genome 

The entire DNA of a substance. It includes inter alia the 
structural genes encoding for the polypeptides of the substance, as 
well as operator, promoter and ribosome binding and interaction 
sequences including sequences such as the Shi ne-Dalgarno sequences. 

Structural Gene 

A DNA sequence which encodes through its template or messenger 
RNA ("mRNA") a sequence of amino acids characteristic of a specific 

polypeptide. 

Transcription 

The process of producing mRNA from a structural gene. 
Translation 

The process of producing a polypeptide from mRNA. 
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Expression 

The process undergone by a structural gene to produce a 
polypeptide. It is a combination of transcription and translation. 

PI asmid . . 

A non-chromosomal douhle-stranded DNA sequence comprising an 

intact "replicon" such that the plasmid is replicated in a host 
cen «hen the plasmid is placed within a unicellular organism, the 
characteristics of that organism are changed or transformed as a 
result of the DNA of the plasmid. For example, a plasmid carrying 
the gene for tetracycline resistance (Tet R ) transforms a ce 
previously sensitive to tetracycline into one which is 
it. A cell transformed by a plasmid is called a "transformant . 
Phage or Bacteriophage 

Bacterial virus many of which include DNA sequences 
encapsidated in a protein envelope or coat ( "capsid" ) . In a 
unicellular organism a phage may be introduced as free UNA by a 
process called transfection. 

Cloning Vehicle 

A plasmid, phage DNA or other DNA sequences which are able to 
replicate in a host cell, characterized by one or a small number of 
endonuclease recognition sites at which such DNA sequences may be 
cut in a determinable fashion without attendant loss of an essential 
biological function of the DNA, e.g., replication, production of 
coat proteins or loss of promoter or binding sites, and which 
contain a marker suitable for use in the identification of 
transformed cells, e.g., tetracycline resistance or arapicillin 
resistance. A cloning vehicle is often called a vector. 
Cloning 

The selection and propagation of a single species. 
Recombinant DNA Molecule 

A hybrid DNA sequence comprising at least two nucleotide 
sequences, the first sequence not normally being found together in 
nature with the second. 
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Expression Control Sequence 

A DNA sequence of nucleotides that controls and regulates 
expression of structural genes when operatively linked to those 

'"""jo attain the objects of this invention it was necessary to 
determine the amino acid sequence and the DNA sequence ot the 
structural gene encoding androgen receptor protein. One 
conventional approach would involve starting with the purified 
androgen receptor protein. However, as described above, significant 
amounts of the protein for such purposes have not been obtained. 

An alternative approach to circumvent the overwhelming 
difficulty of androgen receptor protein purification is direct 
isolation of the DNA encoding the messenger RNA for androgen 

receptor protein. 

Our strategy for isolating AR DNA was based on evidence that 
the AR gene is X-linked and that no other steroid receptor gene is 
locate- on th. X chromosome. Sequence data are available from cDNAs 
for glucocorticoid , estrogen, progesterone, mineralocorticoid and 
vitamin D receptors. Comparison of the derived amino acid sequences 
has revealed a central region of high cysteine content which was 
found also in the v-erb A oncogene product recently identified as 
the thyroid hormone receptor. Within this 61-63 amino acid region 
is an arrangement of 9 cysteine residues that are absolutely 
conserved among steroid receptors thus far characterized. The 
overall homology among sequences in this conserved region ranges 
between 40 and 90%. We assumed that AR would resemble other members 
of the steroid receptor family in the conserved DNA-binding domain. 

A human X chVomosomal library was screened with the synthetic 
olig o nucleotide pVobe A (Oligo A sequence - CTT TIG AAG AAG 
ACC TTA CAG CCC TCA CAG GT 3 ) of Figure 1 (A) designed as a 
consensus sequence Worn the conserved sequence ot the DNA-binding^ 
domain of other ste\oid receptors. Screening the library with the 
oligo A probe resulted in several recombinants whose inserts were 
cloned into bacteriophage M13 DNA and sequenced. One recombinant 
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clone (Charon \b XObAR) (human genomic clone Hi) contained a 
sequence similaY to, yet distinct from, the DNA-binding domains ot 
other steroid reVeptors. It had 84% sequence identity with oligo A, 
while other receWor DNAs were 78% to 91% homologous with the 
consensus ol i gonutleotide. 

From the nucleotide sequence just 5' of the DNA binding domain, 
oligonucleotide P r\be B (Uligo B sequence = b GGA CCA TGT TTT GCC 
CAT TGA CTA TTA CTt\tCC ACC CC 3 ) was synthesized and used to 
screen bacteriophagi lambda gtll cDNA libraries from human 
epididymis and cultured human foreskin fibroblasts. Recombinant 
phage (unamplified) Weened with this oligonucleotide by in situ 
hybridization revealAd one positive clone in each library. The 
epididymal clone (gtl\ ARHEL1 ) (human epididymis clone [1J) contained 
the complete DNA-bindL domain and approximately l.b kb of upstream 
sequence, whereas the fibroblast clone (gtll ARHFL I ) (human 
fibroblast clone [1J) slown in Figure 1(B) contained the DNA-binding 
domain and 1.5 kb of downstream sequence. The DNA-binding domains 
of the cDNA isolates wer^ identical to that of the genomic exon 
sequence. 

Transient expression in monkey kidney cells (COS Mb) 
demonstrated that the human foreskin fibroblast cDNA fragment 
encodes the steroid-binding domain of hAR. A DNA fragment 
(ARHFLIH-X) extending 5' to 3' from the Hind III site within the 
putative DNA-binding domain through the stop codon (TGA) was cloned 
into pCMV as shown in Figure 2(A). Expression was facilitated by 
adding to the 5' end a consensus translation initiation sequence 
containing the methionine codon (ATG) in reading frame. 
Transfection of the recombinant construct produced a protein with 
high-affinity for [ 3 H]di hydrotestosterone, Figure 2(C) saturable 
at physiological levels of hormone. See Figure 2(B). The binding 
constant [K. = 2.7 (+ 1.4) x 10" 10 M) was nearly identical to 
that of native AR. The level of expressed protein, 1.3 pmol per 
milligram of protein, was 20 to 60 times greater than that in male 
reproductive tissues. Mock transf ections without plasmid or 
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transfections with plasmid DNA lacking th AR insert yielded no 
specific binding of di hydrotestosterone. Figure 2(D) shows steroid 
specificity was identical to that of native AR, with highest 
affinity for di hydrotestosterone and testosterone, intermediate 
5 affinity for progesterone and estradiol, and low affinity for 
Cortisol. 

Figure 3 is a clone map compiled to show the human androgen 
receptor gene and the nucleic acid sequences in the cDNA clones, 
human genomic clones, human fibroblast clones, human epididymis 

10 clones, and rat epididymis clones. Human fibroblast clone LU 
extended through the stop codon or the C-terminal end of the 
androgen receptor protein. To isolate and elucidate the sequence ot 
the 5' or N-terminal end of the androgen receptor protein, we used a 
EcoRl/Sstl fragment (EcoRl site was from the linker) from the 5' end 

15 of human epididymis clone [1] as a probe (cDNA probe [A]), to 

rescreen the human X chromosomal library by standard techniques. By 
these techniques, human genomic clone [23 was isolated and in turn 
used as a probe to rescreen a human epididymis library and isolate 
human epididymis clone [5]. The N-terminal sequence was elucidated 

20 along with the 5' flanking sequence of the androgen receptor protein 
and gene. Human genomic clones [3], [4] and [5] for the sequence 3' 
of human genomic clone [1] were obtained using cDNA probes B La Hind 
IIl/EcoRl fragment] and C [an EcoRl fragment], by screening and 
isolating by standard techniques. 

25 Two rat clones, rat epididymis clones [1] and [if], were 

isolated from a rat epididymis cDNA library using as probes the 
complete human epididymis clone [1] and a EcoRl/Pstl fragment, cUNA 
probe [D], respectively. These rat clones contained the entire 
protein coding sequence for the rat androgen receptor, plus flanking 

30 5' and 3' untranslated sequences which were used to confirm the 
sequence of the human androgen receptor. 

The complete double-stranded sequence encoding the human 
^"amirogen recepAr protein was determined and is set forth in Figure 
4. The single-sWanded DNA sequence encoding human androgen 
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receptor proteffi along with the amino acid sequence which it codes 
for are set forlh in Figure b. The single stranded DMA sequence and 
the amino acid sequence for the rat androgen receptor protein is set 
forth in Figure 

Recombinant WlA clones human fibroblast clone [1J isolated from 
human foreskin fibroblast cDNA gtll expression library, human 
epididymis clones Vl] and [5] isolated from human epididymis cDNA 
gtll expression library were deposited in the American Type Culture 

Collection with accession numbers ATCC # ,, ATCC # _ and 

ATCC # respectively. Human genomic clones [1], C'<0, L3], [4J 

and [5] which were isolated from human X chromosome lambda Charon 35 
library available as ATCC # 57750 have been deposited with the 
American Type Culture \col lecti on with accession numbers ATCC 

# , ATCC # \ , ATCC # , ATCC # and ATCC 

# respectively. 

A~wide variety of h\ost=cloning vehicle combinations may be 

usefully employed in clofeing the double stranded DNA disclosed 
herein. For example, useVul cloning vehicles may include 
chroomosomal , non-chromosqmal and synthetic DNA sequences such as 
various known bacterial plVismids and wider host range plasmids such 
as pCMV and vectors derived from combinations of plasmids and phage 
DNA such as plasmids which have been modified to employ phage UNA 
expression control sequenced. Useful hosts may include bacterial 
hosts, yeasts and other fungi, animal or plant hosts, such as 
Chinese Hamster Ovary cells AcHO, or monkey kidney cells (COS M6), 
and other hosts. The particular selection ot host-cloniny vehicle 
combinations may be made by th\ose of skill in the art after due 
consideration of factors such \s the source of the DNA- i.e. genomic 
or cDNA. 

Cloning vehicles for use - in carrying out the present invention 
will further comprise a promoter operably linked to the DNA sequence 
encoding the androgen receptor protein. In some instances it is 
preferred that cloning vehicles further comprise an origin of 
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replication , as wen as seances which refute and or enhance 
.Session UveU .epen.-n, on the host « « - ^ 

Techniques f* «™^;» ^ ' r Lple, -iatis et 
" ^V:: «Pr«.,n, foreign genes in 

the host cell. \ Saccharomyces 

receptor protein. r( »n<; in 

Higher eukaryotic cells can also serve as host ce Us n 
carryin out the present invention. Culture. » « . are 

preferred, doning vehicles for use in m a«alian celis 

Cm- . P-ter capable of d = -,^^1-1- 

foreign gene introduced into a mammalian cell. A1 

the expression vector ^- o ; : 1 i -;-f;:;::;r„; , ,;tro: s:;a, can be 

::r:rrii:i:;::::;: ^ „ - «^ - a 

heterologous gene. CDlprta5lG 
A selectable marker, such as a gene that confers a se eta 

• , , ot „ nruos such as neomycin, hygromycin and 
rnnfer resistance to arugs, sutn j 

2 h trexate. Selectable markers may he Introduced Into the eel 

a separate plasmid at the same time as the 9 ene of interest, or the, 

may be introduced on the same plasmid. 

The copy marker of the integrated gene sequence can be 
increased through amplification by using certain selectable 
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markers. Through selection, expression levels may be substantially 
increased. 

. Androgen receptor proteins may be purified from the host cells 
or cell media according to the present invention using techniques 
5 well known to those in the art. Such proteins may be utilized to 
produce monoclonal or polyclonal antibodies according to the 
techniques described below. 

The techniques of this invention offer considerable advances 
over existing technology for measurement of androgen receptor. 

10 Utilizing proteins and peptides containing the disclosed sequences 
monoclonal or polyclonal antibodies can be produced for use as 
immunochemical reagents in immunodiagnostic assays. For example, 
radioimmunoassays and EL ISA assays can be developed utilizing these 
reagents which will allow detection and quantification of androgen 

15 receptor in the presence of endogenous androgen since such androgen 
will not interfere with antibody binding to the receptor. 

Immunocytochemistry utilizing our reagents enables 
determination and quantification of the cellular distribution of the 
androgen receptor in tumor tissues, which are often heterogenous in 

20 composition. This assay offers great potential for diagnostic 
evaluation of prostate cancer to determine to responsiveness to 
androgen withdrawal therapy. 

In addition, the antibodies produced using the disclosed amino 
acid sequences can also be used in processes for the purification of- 

2b androgen receptor protein produced by the aoove methods. One such 
purification process is disclosed in Logeat, F., et al., 
Biochemistry vol. 24 (198b), pp. 1029-103b, which is incorporated by 

reference herein. 

Androgen receptor proteins and polypeptides synthesized from 
30 the deduced amino acid sequence can be used as immunogens for the 
preparation of antibodies to the androgen receptor. Peptides for 
such use range in length from about 3 to about 958 amino acids in 
length and are preferrably from about lb to about 3U amino acids in 
length. Shorter peptides may have significant sequence homology to 
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other steroid receptor proteins and larger peptides may contain 
multiple antigenic determinants; these properties could result in 
antibodies with cross-reactivities to other steroid receptor 
proteins. 

6 Peptides can be synthesized from amino acid sequences in the 

NH 9 -terminal region, the DNA-binding domain, and the 
carboxyl -terminal steroid binding domain. Peptide selection will be 
based on hydropathic plots, selecting hydrophilic regions that are 
more likely exposed on the receptor surface. For diagnostic 
10 purposes preferred sequences will be selected from the 

NH 2 -terminal region where there is the least homology with other 
steroid receptor proteins. 

Peptides for use as immunogens can be synthesized using 
techniques available to one of ordinary skill in the art. For 
15 example, peptides corresponding to androyen receptor sequences can 
be synthesized using tBOC chemistry on a Biosearch Model 9bOU 
peptide synthesizer. Peptide purity is assessed by high pressure 
liquid chromatography. Peptides can be conjugated to keyhole limpet 
hemocyanin through cysteine residues using the coupling agent 
20 m-maleimido-benzoyl-N-hydroxysuccinimide ester. One can also 

prepare resin-bound peptides utilizing the p-(oxymethyl benzamide) 
handle to attach the C-terminal amino acid to solid-phase resin 
support. 

E^Sp Proteins\and peptides of this invention can be utilized for the 
production oApoyclonal or monoclonal antibodies. Methods for 
production of Lch antibodies are known to those of ordinary skill 
in the art andlmay be performed without undue experimentation. One 
method for the Iproducti on of monoclonal antibodies is described in 
Kohler, G. , et ll . , "Continuous Culture of Fused Cells Secreting 
3D Antibody of Predefined Specificity," Nature , vol. 25b (1975), p. 
495, which is incorporated herein by reference. Polyclonal 
antibodies, by Jfy of example, can be produced by the method 
described below. 
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Peptide conjugates or resin-bound peptides can be injected into 
rabbits according to the procedure of Vaitukaitis et al.. 0 Clin 
Endocrinol Metab, 33. 988-991 (1971) using a standard immunization 
schedule. Antisera titers can be determined in the ELISA assay. 

For eVample, one androgen receptor sequence, 
NH 9 -As P -His\v^-Leu-Pro-Ile-Asp-Tyr-Tyr-Phe-Pro-Pro-Gln-Lys-Thr 

in the 5' region upstream from the DNA-binding domain, was used to 
raise antiserL in rabbits. The antisera react selectively at a 
dilution of l\to 500 with the androgen receptor both in its 
untransformed LlOS form and in its 4-5S transformed form. Receptor 
sedimentation o\ sucrose gradients increases from 4 to 8-10S in the 
presence of antAerum at high ionic strength and from 8-10S to 
11-1ZS at low ionVc strength sucrose gradients. In the ELISA 
reaction against t\e peptide used as immunogen, reactivity was 
detectable at 1 toWooO dilution. This antiserum at a dilution of 
1 to 3000 was found Wfective in staining nuclear androgen receptor 
in rat prostate and dfher male accessory sex glands (see Figure 7). 

Our invention provides new molecular probes comprising 
complementary DNA sequences derived from the deduced sequences 
encoding the androgen receptor for diagnostic purposes. Such probes 
may be used to detect the presence of androgen receptor mRNA in 
tumor cells. Such probes may also be used for detection of androgen 
receptor gene defects. Androgen receptor complementary DNA 
sequences can.be used as hybridization probes to detect 
abnormalities in the androgen receptor gene or in its messenger RNA. 

Androgen receptor DNA sequences disclosed and complementary RNA 
sequences can be used to construct probes for use in DNA 
hybridization assays. An example of one such hybridization assay 
and methods for constructing probes for such assays are disclosed 
in U.S. Patent No. 4,683,195 to Mullis et al., U.S. Patent No. 
4,683,202 to Mullis, U.S. Patent No. 4,617,261 to Sheldon, III et 
al., U.S. Patent No. 4,683,194 to Salki et al . , and U.S. Patent No. 
4,705,886 to Levenson et al., which are hereby incorporated by 
reference. 
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By example, one method for detecting gene deletion utilizes 
Southern blotting and hybridization. DNA can be isolated from 
cultured skin fibroblasts or from leukocytes obtained from blood. 
DNA is cut with restriction enzymes, electrophoresed on an agarose 
gel, blotted onto nitrocellulose, and hybridized with 
[ 32 P]-labeled androgen receptor DNA (see Maniatis, T. et al., 
Molecular Cloning, A Laboratory Manual, Cold Spring Harbor 
Laboratory, 1982, incorporated by reference herein). 

in additionl small mutations can be detected utilizing methods 
known to one of Ordinary skill in the art, from cultured skin 
fibroblasts of tie affected individual. A cDNA library can been 
prepared using stindard techniques. The androgen receptor clones 
can be isolated uiing a [ 32 PJDNA AR probe. The clones AR cDNA can 
then be sequenced \nd compared to normal AR cDNA sequences. 

Alternatively genomic DNA can be isolated from blood leukocytes 
or cultured skin fibroblasts of the affected individual. The DNA is 
then subjected to restriction enzyme digestion, electrophoresis ana 
is blotted onto nitrocellulose. Synthetic oligonucleotides can be 
used to bracket specific exons. Exon sequences are amplified using 
the polymerase chain reaction, cloned into M13 and sequenced. The 
sequences are compared to normal human AR DNA sequences. 

Another method of identifying small mutations or deletions 
takes advantage of the ability of RNase A to cleave regions of 
single stranded RNA in RNA:DNA hybrids. Genomic DNA isolated from 
fibroblasts of affected individuals is hybridized with radioactnve 
RNA probes (Promego Biotec) prepared from wild-type androgen 
receptor cDNA. Mismatches due to mutations would be cleaved by 
RNase A and result in altered sized bands relative to wild-type on 
denaturing polyacryl amide gels. 

Restriction fragment length polymorphism (RFLP) linked to the 
androgen receptor gene locus may be used in prenatal diagnosis and 
carrier detection of androgen insensitivity. For example, the 
presence of RFLPs in normal individuals is first established by 
isolating' DNA from lymphocytes of at least six females (total of 1* 
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X chromosomes). DNA can be isolated using the proteinase X 
procedure and fragmented using a battery of restriction enzymes. 
Preferred are those enzymes that contain the di nucleotide sequence 
CG in their recognition sequence. Southern blots are screened with 
5-10 kb androgen receptor genomic fragments which if possible lack 
repetitive DNA. For those regions containing repetitive elements, 
total human genomic DNA can be added as competitor in the 
hybridization reaction. Alternatively, one can subclone selected 
regions to yield a probe free of repetitive elements. 

For exantole, a human restriction fragment length was determined 

f^^%cDNA probe\B) and Hind III restriction endonuclease using the 
Southern blot ^chnique (See Figure 8). The two RFLP alleles 
detected are a fragment at 6.5 kb (allele) and a fragment at 3.5 kb 
(allele 2). Majqr constant fragment bands are seen at approximately 
2 and 5 kb with m\nor constant bands at 0.9 and 7.5 kb. Allele 1 is 
present in approximately 30% of the X chromosomes of the Caucasian 
population. Allele, 2 is present in approximately 20% of the X 
chromosomes of the Caucasian population. In Figure 9 Lanes A, B and 
D, DNA from women whfc are homozygous for allele 1 is shown. In 
Figure 9 Lane C, DNAYrom a woman who is heterozygous for both 
alleles 1 and 2 is shoWn. Figure 9 Lane E contains DNA from a man 
that only possesses alVtele 2. This RFLP, and others determined by 
the clones we have isolated, will enable one to monitor the androgen 
receptor gene in various\disease conditions described herein. 
25 An example of using the androgen receptor clones to detect 

mutations is shown in FiguVe 9 where five different complete 
androgen insensitive patients' DNA are digested with EcoRl , 
electrophbresed on a SoutheiVi blot, and probed with cDNA probe B. 
The patient in lane B lacks a\ 3kb band indicating that part of the 
androgen receptor gene is delated. Further analysis of this and 
other patients DNA is possibleNwith other AR probes and by 
sequencing by standard methods aVid comparing the abnormal sequence 
to the normal sequence described yierein. 
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Other potential uses for oligonucleotide sequences disclosed, 
for example in construction of therapeutics to block genetic 
expression, will be obvious to one of ordinary skill in the art. 


